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Burkholderia mimosarum strain LMG 23256 is an aerobic, motile. Gram-negative, non-spore- 
forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Mimosa pigra (giant 
sensitive plant). LMG 23256 was isolated from a nodule recovered from the roots of the M. pigra 
growing in Anso, Taiwan. LMG 23256 is highly effective at fixing nitrogen with M. pigra. Here we 
describe the features of B. mimosarum strain LMG 23256, together with genome sequence infor- 
mation and its annotation. The 8,410,967 bp high-quality-draft genome is arranged into 268 scaf- 
folds of 270 contigs containing 7,800 protein-coding genes and 85 RNA-only encoding genes, and is 
one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic 
Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project. 



Introduction 

Members of the versatile genus Burkholderia occu- 
py a wide range of ecological niches and are found 
in soil, hospital environments, associated with 
plants either as epiphytes, endophytes or as patho- 
gens and some are endosymbionts in 
phytopathogenic fungi or plant-associated insects 
[1]. As several Burkholderia strains are known to 
exert plant-beneficial and biocontrol effects, and 
also contribute to adaptation to environmental 
stresses, there is increased interest in the use of 
Burkholderia in agriculture [1,2]. 

In addition to the different groups of rhizobia from 
the Alphaproteobacteria, a number of 
Betaproteobacteria belonging to BurMioIderia and 
Cupriavidus are now also known to be present in 
legume nodules; they are sometimes referred to as 
betarhizobia [3-5]. Several Burkholderia species 
have been described from root nodules of different 



Mimosa species: B. caribensis from M. pudica and M. 
diplotricha [4,6], B. mimosarum from M pigra and 
M. scabrella [7], B. nodosa from M. bimucronata and 
M. scabrella [8], B. phymatum from M. invisa and 
Machaerium lunatum [6,9] and B. sabiae from M. 
caesalpiniifolia [10]. Moreover, several 
Burkholderia strains have been shown to enter into 
effective symbiosis with their host [11]. 

B. mimosarum was described for a collection of iso- 
lates obtained from M. pigra in Taiwan, Venezuela 
and Brazil and one strain from M. scabrella in Brazil 
[7]. Since its first description, B. mimosarum has 
also been isolated from M. pigra nodules in China 
and Australia [12,13], from M. diplotricha in Papua 
New Guinea [14] and M. pudica in French Guiana 
[15]. M. pigra, as well as M. pudica and M. 
diplotricha, are notoriously invasive species [16]. 
M. pudica (sensitive plant) is a small South Ameri- 
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can shrub that has become a pan-tropical weed, 
while M. pigra (giant sensitive plant, black mimosa, 
prickly wood weed, catclaw mimosa) is a shrub 
that thrives in floodplains, swamps and river banks, 
where it creates dense spiny thickets [17]. M. 
diplotricha (creeping sensitive plant, nila grass, gi- 
ant sensitive plant] is a climbing shrub that scram- 
bles up other plants, quickly producing dense 
growth [18]. The success of these invasive weeds 
may in part be due to their highly effective symbi- 
otic associations. 

B. mimosarum LMG 23256T (=BCRC 17516, CCUG 
54296, NBRC 106338, PAS44) originates from 
nodules of M. pigra in Taiwan. This legume weed 
is predominantly nodulated by B. mimosarum in 
Taiwan. Other Taiwanese Mimosa species are 
nodulated mainly by Cupriavidus taiwanensis and 
it has therefore been suggested that the 
Burkholderia strains were introduced to Taiwan, 
along with the invasive M. pigra from its native 
South America, where Burkholderia strains have 
been isolated more frequently from Mimosa sp. 
than C. taiwanesis [7,19]. 

Here we present a summary classification and a 
set of features for B. mimosarum strain LMG 
23256T (Table 1), together with the description of 
the complete genome sequence and its annotation. 

Classification and features 

B. mimosarum strain LMG 23256T is a non- 
sporulating, non-encapsulated. Gram-negative rod 
within the order Burkholderiales of the class 
Betaproteobacteria. The rod-shaped form varies in 
size; it is approximately 1.0 |im in width and 2.0 
|im in length (Figure 1, Left and Figure 1, Center). 



It is fast-growing, forming colonies within 3-4 
days when grown on half strength Lupin Agar 
(VzLA) [32], tryptone-yeast extract agar (TY) [33] 
or a modified yeast- mannitol agar (YMA) [34] at 
28°C. Colonies on VzLA are white-opaque, slightly 
domed and moderately mucoid with smooth mar- 
gins (Figure 1, Right). Minimum Information 
about the Genome Sequence (MIGS) is provided in 
Table 1. Figure 2 shows the phylogenetic neigh- 
borhood of B. mimosarum strain LMG 23256T in a 
16S rRNA sequence based tree. This strain shares 
99% (1,121/1,124 bp) and 98% (1,101/1,125 bp) 
sequence identity to the 16S rRNA of the fully se- 
quenced strain B. mimosarum STM3621 (Gi08839) 
and to B. nodosa Br3461'r, respectively. 

Symbiotaxonomy 

B. mimosarum LMG 23256T was isolated from M. 
pigra growing in Anso, Taiwan and was able to 
nodulate its original host with high efficiency [19], 
as well as M. pucida and M. diplotricha [14]. LMG 
23256T was shown to outcompete other rhizobia 
to the point of exclusion for the nodulation of the 
invasive M. pigra, M. pudica and M. diplotricha un- 
der flooded conditions. This predominance was 
negatively affected by increased nitrate levels in 
the soil, which thus seems to be a factor affecting 
rhizobial competition [14]. 

With regard to other plant growth promoting 
properties, LMG 232561" displayed no antifungal 
activity against Fusarium oxysporum f. sp. phaseoli, 
did not solubilize calcium-, iron- or aluminum 
phosphates nor reduce acetylene (ARA) on the N- 
free media containing fructose, lactate or mannitol 
as sole carbon source [39]. 




Figure 1. Images of Burkholderia mimosarum strain LMG 23256^ using scanning (Left) and transmission (Center) 
electron microscopy and the appearance of colony morphology on a solid medium (Right). 
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Table 1. Classification and general features of Burkholderia mimosarum strain LMG 232 56^ 

according to the MIGS recommendations [201 



MIGS ID 


Property 


Term 


Evidence code 






Domain Bacteria 


TAS [21] 






Phylum Proteobacteria 


IAS [22] 






Class Betaproteobacteria 


TAS [23,24] 




Current classification 


Order Burkholderiales 


TAS [24,25] 






Family Burkholderiaceae 


TAS [24,26] 






Genus Burkholderia 


TAS [27-29] 






Species Burkholderia mimosarum 


TAS [7] 






Strain LMG 232 56'^ 






Gram stain 


Negative 


IDA 




Cell shape 


Rod 


IDA 




Motility 


Motile 


IDA 




Sporulation 


Non-sporulating 


NAS 




Temperature range 


Mesophile 


NAS 




Optimum temperature 


28°C 


NAS 




Salinity 


Non-halophile 


NAS 


MIGS-22 


Oxygen requirement 


Aerobic 


TAS [19] 




Carbon source 


Varied 


NAS 




Energy source 


C he moorg a notrop h 


NAS 


MIGS-6 


Habitat 


Soil, root nodule, on host 


TAS [19] 


MIGS-15 


Biotic relationship 


Free living, symbiotic 


TAS [19] 


MIGS-14 


Pathogenicity 


Non -pathogenic 


NAS 




Biosafety level 


1 


TAS [30]* 




Isolation 


Root nodule of Mimosa pigra 


TAS [19] 


MIGS-4 


Geographic location 


Anso, Taiwan 


TAS [19] 


MIGS-5 


Soil collection date 


Not recorded 


IDA 


MIGS-4. 1 


Longitude 


120.872 2 2 


IDA 


MlGS-4.2 


Latitude 


22.28889 




MIGS-4.3 


Depth 


Not recorded 


IDA 


MIGS-4.4 


Altitude 


Not recorded 


IDA 



*Strain catalogue BCCM/LMG http://bccm.belspo.be/db/lmg_search_form.php 

Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a di- 
rect report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly ob- 
served for the living, isolated sample, but based on a generally accepted property for the spe- 
cies, or anecdotal evidence). These evidence codes are from the Gene Ontology project [31]. 
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961 Burkholderia sp. WSM4176 (HQ698909, Gi08873) 

L Burkholderia tuberum LMG 21444^ (AJ30231 1) 
- Burkholderia sp. mcas7.1 (AY528706, Gi08846) 
Burkholderia sp. JPY251 (FN543663, Gi08874) 

Burkholderia nodosa Br3437T (AY7731 89) 



Burkholderia mimosarum LMG 23256^ (AB537489, Gi08823) 



99 I Burkholderia mimosarum STM3621 (FN908408, Gi08839) 
Burkholderia sp. WSM2230 (HQ677909, Gi08831) 
Burkholderia sp. WSM2232 (HQ677910, Gi08832) 
Burkholderia sp. WSM3937 {EU219865, Gi08878) 
- Burkholderia sp. WSM3556 (HQ698908, Gi08872) 



Burkholderia sp. UYPR1.413 (JF683693, Gi08829) 
Burkholderia sabiae Br3407T (AY773186) 
— Burkholderia caribiensis MWAP64T (Y17009) 



Burkholderia phymatum LMG 21445^ {AJ302312) 
Burkholderia sp JPY366 (HM357231, Gi08876) 
100 ' Burkholderia sp. JPY347 (FN543709, Gi08875) 
- Burkholderia cepacia ATCC25416T (AF097530) 

93 r Cupriavidus taiwaner}sis LMG 19424^ (AF300324, Gc00754)* 
' Cupriavidus taiwanensis STM6018 (FN908410, Gi08840) 

100 I Cupriavidus sp. ampS (DQ530646, Gi08845) 

Cupriavidus taiwanensis STM6070 (FN908230, Gi08841) 
Cupriavidus sp UYPR2.512 (JF683703, Gi08830) 



35 n 

651— 



0.01 

Figure 2. Phylogenetic tree showing the relationship of Burkholderia mimosarum strain LMG 23256^ (shown in 
bold print) to other members of the order Burkholderiales based on aligned sequences of the 1 6S rRNA gene (1,242 
bp internal region). All sites were informative and there were no gap -containing sites. Phylogenetic analyses were 
performed using MEGA, version 5 [35]. The tree was built using the Maximum-Likelihood method with the Gen- 
eral Time Reversible model [36]. Bootstrap analysis [37] with 500 replicates was performed to assess the support of 
the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database 
accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD 
[38]. Published genomes are indicated with an asterisk. 



Genome sequencing and annotation 

Genome project history 

This organism was selected for sequencing on the 
basis of its environmental and agricultural rele- 
vance to issues in global carbon cycling, alternative 
energy production, and biogeochemical im- 
portance, and is part of the Community Sequencing 
Program at the U.S. Department of Energy, Joint 
Genome Institute QGI) for projects of relevance to 
agency missions. The genome project is deposited 
in the Genomes OnLine Database [38] and an im- 
proved-high-quality-draft genome sequence in 
IMG. Sequencing, finishing and annotation were 



performed by the JGI. A summary of the project in- 
formation is shown in Table 2. 

Growth conditions and DNA isolation 

B. mimosarum strain LMG 232561" cultured to 
mid logarithmic phase in 60 ml of TY rich medium 
on a gyratory shaker at 28°C [40]. DNA was isolat- 
ed from the cells using a CTAB (Cetyl trimethyl 
ammonium bromide) bacterial genomic DNA iso- 
lation method. 
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Ta ble 2. Genome sequencing project information for Burkholderia mimosarum LMG 23256^. 



MIGS ID 



Property 



Term 



MIGS-31 

MIGS-28 

MIGS-29 

MIGS-31. 2 

MIGS-30 

MIGS-32 



Finishing quality 
Libraries used 
Sequencing platforms 
Sequencing coverage 
Assemblers 
Gene calling methods 
GOLD ID 
NCBI project ID 
Database: IMG 
Project relevance 



Improved high-quality draft 
One lllumina fragment library 
lllumina HiSeq2000 
lllumina: 240x 

Velvet version 1.1.04; Allpaths-LG version r39750 

Prodigal 1 .4 

G 10882 3 

163559 

25132 37083 

Symbiotic N2 fixation, agriculture 



Genome sequencing and assembly 

The genome of B. mimosarum strain LMG 232561" 
was sequenced at the Joint Genome Institute QGI) 
using lllumina technology [41]. An lllumina standard 
shotgun library was constructed and sequenced us- 
ing the lllumina HiSeq 2000 platform, which gener- 
ated 14,635,038 reads totaling 2,014 Mbp. 

All general aspects of library construction and se- 
quencing performed at the JGI can be found at 
http : / / my .j gi.do e. gov /general/index. html . All raw 
lllumina sequence data was passed through DUK, a 
filtering program developed at JGI, which removes 
known lllumina sequencing and library prepara- 
tion artifacts (Mingkun, L, Copeland, A. and Han, J., 
unpublished). The following steps were then per- 
formed for assembly: (1) filtered lllumina reads 
were assembled using Velvet [42] (version 1.1.04), 
(2) 1-3 Kbp simulated paired end reads were cre- 
ated from Velvet contigs using wgsim [43], (3) 
lllumina reads were assembled with simulated 
read pairs using Allpaths-LG [44] [version 
r39750). Parameters for assembly steps were: 

1 ) Velvet (-V -s 51 --e 71 -i 2 -t 1 -f "- 

shortPaired -fastq $FASTQ" — o "- 
ins_length 250 -min_contig_lgth 
500") 10) 

2) wgsim (-6 0 -1 76 -2 75 -r 0 -R 0 -X 0) 

3) Allpaths-LG 

(PrepareAllpathslnputs:PHRED64=1 

PLOIDY=1 FRAGCOVERAGE=12 5 

JUMPCOVERAGE=25 

LONGJUMPCOV=50, RunAllpath- 

sLG: THREADS=8 

RU N=stdshredpairs TAR- 

GETS=standard 

VAPIWARNONLY=True OVER- 
WRITE=True). 



The final draft assembly contained 270 contigs in 
268 scaffolds. The total size of the genome is 8.4 
Mbp and the final assembly is based on 2,014 Mbp 
of lllumina data, which provides an average 240x 
coverage of the genome. 

Genome annotation 

Genes were identified using Prodigal [45] as part of 
the DOE -JGI annotation pipeline [46]. The predicted 
CDSs were translated and used to search the Na- 
tional Center for Biotechnology Information (NCBI) 
nonredundant database, UniProt, TIGRFam, Pfam, 
PRIAM, KEGG, COG, and InterPro databases. The 
tRNAScanSE tool [47] was used to find tRNA genes, 
whereas ribosomal RNA genes were found by 
searches against models of the ribosomal RNA 
genes built from SILVA [48]. Other non-coding 
RNAs such as the RNA components of the protein 
secretion complex and the RNase P were identified 
by searching the genome for the corresponding 
Rfam profiles using INFERNAL [49]. Additional 
gene prediction analysis and manual functional an- 
notation was performed within the Integrated Mi- 
crobial Genomes (IMG-ER) platform [50]. 

Genome properties 

The genome is 8,410,967 nucleotides 63.89% GC 
content (Table 3) and comprised of 268 scaffolds 
(the four largest scaffolds are shown in Figures 3a, 
3b, 3c and Figure 3d) of 270 contigs. From a total of 
7,885 genes, 7,800 were protein encoding and 85 
RNA only encoding genes. The majority of genes 
(75.13%) were assigned a putative function whilst 
the remaining genes were annotated as hypothet- 
ical. The distribution of genes into COGs functional 
categories is presented in Table 4. 
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Table 3. Genome Statistics for B. mimosarum strain LMG 23256^ 



Attribute 


Value 


% of Total 


Genome size (bp) 


8,410,967 


100.00 


DNA coding region (bp) 


7,084,1 75 


84.23 


DNA G+C content (bp) 


5,373,761 


63.89 


Number of scaffolds 


268 




Number of contigs 


270 




Total gene 


7,885 


100.00 


RNA genes 


85 


1.08 


rRNA operons* 


1 


0.01 


Protein-coding genes 


7,800 


98.92 


Genes with function prediction 


5,924 


75.13 


Genes assigned to COGs 


5,870 


74.45 


Genes assigned Pfam domains 


6,242 


79.16 


Genes with signal peptides 


673 


8.54 


Genes with transmembrane helices 


1,680 


21.31 


CRISPR repeats 


0 





*5 copies of 5S, 1 copy of 1 6S and 2 copies of 23S rRNA. 



Table 4. Number of prote 


in codinj 


TT rro noc r^f R kki i'iti /^c/^ Vl 1 YYi ctro i n 1 KA C^. '? '^'7 ^ n cc r\n'\ a/A \A/lfK 
^ y CI Itrb Ul D. iIlUIIUjLIi Hill bird 1 M L/Vl vj Z jZ DD dbbUL. 1 d LcU Wl Ll I 


the general 


COG functional categ 


Ul ICS. 


Code 


Value 


%aee 


D^Gr'i'intinn 


1 


191 


2.89 


Tfdnsldtion, ribosomdl structure dnd biogenesis 


A 


5 


0.09 


RNA processing dnd modification 




588 


8.89 


Transcription 


L 


415 


6.28 


r> 1 • i' 1 ' i' 1 

Replication, recombination and repair 


B 


2 


0.03 


Chromatin structure and dynamics 


D 


50 


0.76 


Cell cycle control, mitosis and meiosis 


Y 


0 


0.00 


Nuclear structure 


V 


71 


1.07 


Defense mechanisms 


T 


376 


5.69 


Signal transduction mechanisms 


M 


414 


6.2 6 


Cell wall/membrane biogenesis 


N 


146 


2.21 


Cell motility 


Z 


0 


0.00 


Cytoskeleton 


W 


0 


0.00 


Extracellular structures 


u 


161 


2.43 


Intracellulartrafficking and secretion 


o 


208 


3.15 


Posttranslational modification, protein turnover, chaperones 


c 


489 


7.39 


Energy production conversion 


G 


435 


6.58 


Carbohydrate transport and metabolism 


E 


623 


9.42 


Amino acid transport metabolism 


F 


98 


1.48 


Nucleotide transport and metabolism 


H 


226 


3.42 


Coenzyme transport and metabolism 


1 


316 


4.78 


Lipid transport and metabolism 


P 


293 


4.43 


Inorganic ion transport and metabolism 


Q 


231 


3.49 


Secondary metabolite biosynthesis, transport and catabolism 


R 


745 


11.27 


General function prediction only 


S 


529 


8.00 


Function unknown 




2,015 


25.55 


Not in COGS 




6,612 




Total 
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Figure 3a. Graphical map of LMG 23256_A1 9UDRAFT_scaffold_0.1 of the B. 
mimosarum strain LMG 2 3256^ genome. From bottom to the top of each scaf- 
fold: Genes on forward strand (color by COG categories as denoted by the IMG 
platform), Genes on reverse strand (color by COG categories), RNA genes 
(tRNAs green, sRNAs red, other RNAs black), GC content, GC skew. 




Figure 3b. Graphical map of LMG 23256_A1 9U DRAFT_scaffold_l .2 of the B. 
mimosarum strain LMG 23256^ genome. From bottom to the top of each scaffold: 
Genes on forward strand (color by COG categories as denoted by the IMG plat- 
form). Genes on reverse strand (color by COG categories), RNA genes (tRNAs 
green, sRNAs red, other RNAs black), GC content, GC skew. 




Figure 3c. Graphical map of LMG 232 56_A19UDRAFT_scaffold_2.3 of the B. 
mimosarum strain LMG 23256^ genome. From bottom to the top of each scaffold: 
Genes on forward strand (color by COG categories as denoted by the IMG plat- 
form). Genes on reverse strand (color by COG categories), RNA genes (tRNAs 
green, sRNAs red, other RNAs black), GC content, GC skew. 
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Figure 3d. Graphical map of LMG 232 56_A19UDRAFT_scaffolcl_3.4 of the B. 
mimosarum strain LMG 23256^ genome. From bottom to the top of each scaffold: 
Genes on forward strand (color by COG categories as denoted by the IMG plat- 
form), Genes on reverse strand (color by COG categories), RNA genes (tRNAs 
green, sRNAs red, other RNAs black), GC content, GC skew. 
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